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(57) According to the present invention, the 
polypeptide of theobromine synthase derived from cof- 
fea arabica and the gene encoding said polypeptide are 
provided. As theobromine synthase participates in bio- 



synthesis of caffeine, caffeineless coffee would be ob- 
tained by preparing a transformed plant, wherein ex- 
pression of gene encoding said enzyme is inhibited. 
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Description 

BACKGROUND OF THE INVENTION 
5 1 . Field of the invention 

[0001] This invention relates to theobromine synthase polypeptide and the gene encoding said enzyme. 
2. Prior art 

w 

[0002] Coffee is a drink consumed alt over the world with favorite and its utility is markedly large. On the other hand, 
it is known that excessive ingestion of caffeine, which is contained in coffee, causes harmful effects. Caffeine is one 
of xanthine derivatives and theophylline and theobromine are also the members of the xanthine derivatives. These 
xanthine derivatives are known to inhibit phosphodiesterase, thereby the amount of cAMP is increased. As the result, 

is xanthine derivatives exhibit excitatory effect on the central nerves system and enhance function of the circulatory 
system. When they are ingested at a suitable amount, such effects of xanthine derivatives are useful for spiritual 
elevation. However, when the amount of digestion is excessive, they would cause harmful effects as mentioned above. 
Therefore, there has been a strong demand on production of a caffeine-less coffee all over the world. 
[0003] To obtain caffeine-less coffee, atlempls to obtain a gene involved in biosynthesis of xanthine derivatives have 

20 been performed, in the purpose to achieve artificial control of biosynthesis of caffeine. In Fig. 1 (cited from Advances 
in Botanical Research, Vol. 30, Academic Press (1999) p149), the pathway working for caffeine biosynthesis in coffee 
plants is shown. In Fig. 1 , the arrow with solid line indicates the main pathway of caffeine synthesis and the arrow with 
dotted line indicates the minor pathway of caffeine synthesis, respectively. As shown in the second line of Fig. 1, the 
pathway operating for biosynthesis of caffeine from xanthosine via 7-methylxanthinc and theobromine has been known, 

25 which is the main pathway for biosynthesis of caffeine biosynthesis in coffee plants. The latter half of the main biosyn- 
thesis pathway of caffeine is composed of three steps of N-methylation reactions. These N-methylation reactions have 
been known to be dependent on S-adenosylmethoinine. There also exists a pathway (third line in Fig. 1) in which 
caffeine is biosynthesized from 7-methylxanthine via para-xanthine, but it is known that contribution of this pathway is 
not significant. With regard to the first methylation reaction to synthesize 7-methylxanthine, a gene encoding an enzyme 

30 responsible for said reaction has been obtained and it has been already reported (International Laid-Open Publication 
No. WO 97/35960). However, genes involved in the second step methylation reaction and the third step methylation 
reaction have not been known yet. For effective and accurate manipulation of caffeine biosynthesis, more knowledge 
on genes that encode enzymes involved in caffeine biosynthesis should be obtained. 

35 SUMMARY OF THE INVENTION 

[0004] The first aspect of this invention is a polypeptide consisting of an amino acid sequence defined by amino acid 
numbers from 1 to 378 shown in SEQ ID NO: 1 in a Sequence List. A polypeptide consisting of an amino acid sequence 
exhibiting at least 90% of homology with SEQ ID NO: 1 is also within the scope of this invention, so far as the polypeptide 

40 has the activity to biosynthesize theobromine using 7-methylxanthine as the substrate. Such sequence may be obtained 
by making deletions, insertions, substitutions or any combinations thereof in the amino acid sequence of SEQ ID NO: 1 . 
[0005] The second aspect of this invention is a gene consisting of a base sequence defined by base numbers from 
1 to 1298 shown in SEQ ID NO: 2 in a Sequence List. A gene that hybridizes with SEQ ID NO: 2 under a stringent 
confdition and a gene consisting of a base sequence exhibiting at least 90% of homology with SEQ ID NO: 2 is also 

45 within the scope of this invention, so far as the gene encodes a polypeptide having the activity to biosynthesize theo- 
bromine using 7-methylxanthine as the substrate. Such sequence may be obtained by making deletions, insertions, 
substitutions or any combinations thereof in the base sequence of SEQ ID NO: 2. 

[0006] The third aspect of this invention is a transformed plant wherein expression of said gene is inhibited in the 
plant to decrease biosynthesis of theobromine and a seed obtained from the transformed plant. Preferably, the plant 
50 to be transformed is selected from the group consisting of Coffea arabica, Coffea canephora, Coffea liberica and Coffea 
dewevrei. 

[0007] The fourth aspect of this invention is a transformed plant wherein said gene is introduced in the plant to 
increase biosynthesis of theobromine and a seed obtained from the transformed plant. Preferably, the plant to be 
transformed is selected from the group consisting of Coffea arabica, Coffea canephora, Coffea liberica and Coffea 
55 dewevrei. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

[0008] The present invention will be further explained in detail hereafter with reference to the accompanying drawings, 
in which: 

5 

Fig. 1 is a drawing showing the pathway of caffeine biosynthesis ; 

Fig. 2 is a drawing showing base sequences of cDNAs obtained from MTL1 , MTL2, MTL3 and MXMT1 ; 
Fig. 3 is a drawing showing alignment of amino acid sequences obtained from MXMT1 , MTL2 and MTL3 ; 
Fig. 4 is a photograph showing the results of SDS-PAGE analyses performed on fusion proteins obtained from 
10 MTL2, MTL3 and MXMT1 ; 

Fig. 5 is a photograph showing the results of TLC to analyze enzymatic activities of the fusion proteins obtained 
from MTL2, MTL3 and MXMT1 ; and 

Fig. 6 is a chart showing the results of HPLC performed to identify reaction products in the enzymatic reaction 
mixture of the fusion protein obtained from MXMT1 identified by HPLC. 



15 



DETAILED DESCRIPTION OF THE INVENTION 



[0009] The present inventors remarked an enzyme participating to the second methylation step reaction and respon- 
sible for biosynLhesis of theobromine, and they have obtained the gene encoding the enzyme. The enzyme is an 

20 enzyme operating to catalyze biosynthesis of theobromine from 7-methylxanthine. Therefore, when expression of the 
gene encoding said enzyme in inhibited, it would result in decrease of theobromine biosynthesis. In the pathway of 
caffeine biosynthesis, caffeine is synthesized through N-methylation of theobromine. Then when biosynthesis of the- 
obromine is inhibited, biosynthesis of caffeine would be inhibited as well. As described above, theobromine and caffeine 
exhibit similar pharmacological effect as xanthine derivatives. Therefore, isolation of a gene encoding an enzyme, 

25 which enables concurrent manipulation of theobromine biosynthesis and caffeine biosyntheses, has a great signifi- 
cance. That is, if a gene encoding an enzyme responsible for the final step of caffeine biosynthesis, i.e. the third 
methylation step, is isolated, then expression of the gene can be inhibited. As a result, biosynthesis of caffeine would 
be reduced, but biosynthesis of theobromine would not be reduced. Moreover, accumulation of theobromine is expected 
to occur as the metabolism of theobromine is inhibited. Thus, considering that pharmacological effect of theobromine 

30 is similar to that of caffeine, the effect of the present invention, which relates to isolation of a gene encoding theobromine 
synthase, can be estimated to be significant. 

[0010] The present invention relates to theobromine synthase gene derived from Coffea arabica, consisting of a 
base sequence defined by the base numbers 1 to 1298 shown in SEQ.ID. NO:2 in a Sequence List. As described 
above, in coffee plants, theobromine synthase catalyzes methylation reaction at biosynthesis of theobromine using 
35 7-methylxanthine as the substrate. The gene defined by the base sequence described in SEQ.ID. NO:2 in a Sequence 
List is a gene encoding theobromine synthase having such characteristic. 

[0011] According to technique of gene recombination, artificial modification can be achieved at a specific site of basic 
DNA, without alteration or with improvement of basic characteristic of said DNA. Concerning a gene having native 
sequence provided according to this invention or modified sequence different from said native sequence, it is also 

40 possible to perform artificial modification such as insertion, deletion or substitution to obtain gene of equivalent or 
improved characteristic compared with said native gene. Moreover, a gene with such mutation is also included in the 
range of this invention. That is, the gene, consisting of a base sequence hybridizes with said base sequence shown 
in SEQ ID NO: 2 in the sequence list under stringent condition, means a gene in which 1 0 or less, preferably 7 or less, 
and more preferably 3 or less bases of the sequence is deleted, substituted or added to the base sequence shown in 

45 SEQ ID NO: Z in a Sequence List. Moreover, such gene exhibits homology 90% or more, preferably 95% or more and 
still preferably 99% or more with the base sequence shown in SEQ ID NO: 2 in a Sequence List, in addition, such gene 
hybridizes with the base sequence shown in the SEQ ID NO: 2 in a Sequence List under slringenl condition. Such 
gene is also within the range of this invention so far as it encodes a polypeptide having the characteristic as theobromine 
synthase, that catalyzes biosynthesis of theobromine using 7-methylxanthine as the substrate. 

50 [001 2] Furthermore, this invention relates to polypeptide of theobromine synthase derived Coffea arabica, consisting 
of an amino acid sequence defined by the amino acid numbers from 1 to 378 shown in SEQ ID NO: 1 in a Sequence 
List. The polypeptide consisting of an amino acid sequence in which a part of said polypeptide defined by amino acid 
sequence shown in SEQ ID NO: 1 is deleted, substituted or added with another amino acid sequence means a polypep- 
tide in which 10 or less, preferably 7 or less, and more preferably 3 or less amino acids of the sequence is deleted, 

55 substituted or added to the amino acid sequence shown in SEQ ID NO: 1 in a Sequence List. Moreover, such polypeptide 
exhibits homology 90% or more, preferably 95% or more and still preferably 99% or more with the amino acid sequence 
shown in SEQ ID NO: 1 in a Sequence List. Such polypeptide is also within the range of this invention so far as it 
exhibits characteristic as theobromine synthase, that catalyzes biosynthesis of theobromine using 7-methylxanthine 
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as the substrate. Incidentally, the polypeptides shown in SEQ.ID. NO:3, SEQ.ID. NO:5 and SEQ.ID. NO:7 in a Sequence 
List can be obtained from coffee arabica (Coffea arabica), and the polypeptides have higher than 80% of homology 
compared with the amino acid sequence of SEQ.ID. NO:1 in a Sequence List. These three polypeptides did not exhibit 
activity as theobromine synthase, despite of high homology to SEQ.ID. NO:1 in a Sequence List. 
[0013] A transformed plant, in which expression of theobromine synthetase gene described in SEQ.ID. NO:2 in a 
Sequence List is inhibited to decrease biosynthesis of theobromine, is also within the scope of the present invention. 
The theobromine synthase gene of the present invention is, as mentioned above, a gene encoding an enzyme involved 
in biosynthesis of theobromine in coffea arabica. Thus, by inhibiting expression of the gene according to the present 
invention, biosynthesis of theobromine is assumed to decrease in a plant, whereby it enables decrease of theobromine 
content and caffeine content in the plant. As a plant of the target in which expression of theobromine synthase gene 
of the present invention is inhibited, coffee plants such as Coffea arabica, Coffea canephora, Coffea liberica and Coffea 
dewevrei and the like can be exemplified. 

[0014] In these plants, by inhibiting expression of the gene of the present invention, biosyntheses of theobromine 
and caffeine would be reduced. As a means for inhibiting expression of the gene of the present invention, a method 
utilizing an antisense gene (antisense gene method) can be adopted. The antisense gene means a gene that expresses 
a base sequence complementary to mRNA, a transcription product of DN A constituting a certain gene. The transcription 
product of the antisense gene is complementary to an inherent mRNA, then the antisense gene can inhibit gene ex- 
pression at the stage of translation. By utilizing this technique, expression of theobromine synthase gene can be in- 
hibited. 

20 [0015] In addition, other methods that can inhibit expression of a gene have been known . By destruction of a targeted 
gene, expression of the gene can be inhibited. Moreover, in a plant, technique of co-suppression (trans witch technique) 
has been known. According to the technique, expression of the targeted gene can be inhibited by phenomenon of gene 
interference, even when sense gene is introduced and over-expressed. Moreover, it has been reported in recent years 
that Doublcstranded RNA interference (RNAi) method using a double stranded RNA is effective to inhibit expression 
of a gene (Chiou-Fen Chuang et al. PNAS (2000) vol. 97, 4985-4990). It has been demonstrated that a double strand 
RNA can inhibit expression of a gene in a sequence specific manner, according to the research mainly utilizing nem- 
atodes (C.elegans) or fruit fly. In the RNAi method, such double strand RNA is utilized and it has been recently dem- 
onstrated that the method is effective for not only nematodes or fruit fly but also for plants such as Arabidopsis thaliana 
Heynh. The mechanism involved in inhibition of gene expression by the RNAi method is not known yet. However, this 
method would enable inhibition of expression of a gene, with higher efficiency compared with the above-mentioned 
antisense method. 

[0016] By the way, purine alkaloids such as caffeine and theobromine, can exhibit effect to avoid insects and the 
effect is considered to be the existence value of purine alkaloids in a plant. Thus, the gene of the present invention 
can be introduced in a plant and biosythesis of theobromine can be increased in the plant, whereby the plant body 
would exhibit insect-avoiding activity. As described above, the enzyme of the present invention is responsible for bio- 
synthesis of theobromine using 7-methylxantine as the substrate. Therefore, it is assumed that, when the above-men- 
tioned gene encoding the 7-methylxanthine synthase (International Laid-Open Publication WO 97/35960) and the gene 
of the present invention are introduced into a plant concurrently, the effect would be particularly significant. When the 
activity of 7-methylxanthine synthase is enhanced, the amount of substrate available for the enzyme according to the 
present invention would be increased. As a result, accumulation of theobromine, which is the objective product is 
expected to occur. 

[0017] As a method to produce a transformant, a method generally well known in this art can be adopted. A vector 
available for the present invention may include plasmid vectors, for example pBI121 can be exemplified, but the scope 
the vector is not to be limited to them. Such vector can be introduced into, for example, Agrobacterium. Then the 

45 bacteria can be utilized for infection of callus or plantlets, resulting in production of transformed plants. Furthermore, 
it is possible to obtain seeds derived from such transformed plants. In Japanese Laid-Open Patent Application No' 
2000-245485, the present inventors have reported a method comprising infection of an embryogenic callus of a coffee 
plant by Agrobacterium tumefaciens EHA1 01 and the method enables transformation of coffee plants with high efficacy. 
The method for transformation described in Japanese Laid-Open Patent Application No. 2000-245485 is assumed to 

50 be particularly useful. 

EXAMPLES 
(Amplification by PCR) 

[0018] A pair of degenerate oligonucleotide (Forward primer, GGITGYDSIDSIGGICCIAAYAC; Reverse primer, AR- 
IYKIYYRTRRAAISWICCIGG) was synthesized, based on the region conserved among TCS1 (Kato et al., 2000 Gen- 
Bank accession no. AB031280) and two proteins (Z99708 and AC008153), with their functions unknown, of Arabidopsis 
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thaliana. These oligonucleotides correspond to amino acid sequences of GC(A/S)(A/S)GPNT and PGSF(H/Y)(G/K) 
(R/N)LR respectively. In a 25 uJ of reaction mixture containing Coffea arabica cDNA and the above-mentioned primer 
pair, PCR was performed under the conditions described below. That is, after reaction at 94°C for one minute, 30 cycles 
of denaturation at 94°C for 30 seconds, annealing at 52°C for 30 seconds and extension at 72°C for one minutes was 
5 performed, which was followed by a final extension at 72°C for 7 minutes, whereby the PCR reaction was completed. 
The amplified cDNA fragment of about 270 base pairs was used for screening of cDNA library. 

(cDNA library construction and screening) 

10 [0019] Total RNA was extracted from young leaves of coffee (Coffea arabica) and it was purified to mRNA by oligo- 
dT column (Pharmacia). cDNA was synthesized from mRNA using ZAPII cDNA synthesis kit (Stratagene), it was in- 
troduced into XZAPII vector to prepare phage library. Then cDNA library was screened using the above-mentioned 
amplified fragment as a probe. Thirty-five of resulting positive plaques were selected randomly and converted to plas- 
mids, then physical mappping and partial sequencing were performed. As a result, they were clarified into 4 groups of 

*5 independent clones. 

[0020] Clones #1 , #6, #35 and #45 were representatives of each group having the longest lengths close to full length 
cDNAs, and base sequences of the clones were determined. Moreover, the deduced amino acid sequences encoded 
by the open reading frame regions of the base sequences were determined. The base sequences determined by 
sequencing were shown in Fig. 2. The base sequence of cDNA obtained on the clone #45 was shown in SEQ.ID. NO: 
2 in a Sequence List and in Fig. 2D. The region corresponding to open reading frame of said gene ranged from base 
numbers 32 to 1168, and the deduced amino acid sequence encoded by said region was shown in SEQ.ID. NO:1 in 
a Sequence List. Moreover, the base sequence of cDNA obtained on the clone #1 was shown in SEQ.ID. NO:4 in a 
Sequence List and in Fig. 2A. The region corresponding to open reading frame of said gene ranged from base numbers 
1 4 to 1 1 71 , and the deduced amino acid sequence encoded by said region was shown in SEQ. ID. NO:3 in a Sequence 
25 List. Furthermore, the base sequence of cDNA obtained on the clone #6 was shown in SEQ.ID. NO:6 in a Sequence 
List and in Fig. 2B. The region corresponding to open reading frame of said gene ranged from base numbers 44 to 
1201 , and the deduced amino acid sequence encoded by said region was shown in SEQ.ID. NO:5 in a Sequence List. 
Moreover, the base sequence of cDNA obtained on the clone #35 was shown in SEQ.ID. NO:8 in a Sequence List and 
in Fig. 2C. The region corresponding to open reading frame of said gene ranged from base numbers 45 to 1163, and 
30 the deduced amino acid sequence encoded by said region was shown in SEQ.ID. NO:7 in a Sequence List. In the 
following, the gene corresponds the clone #45 was designated to MXMT1 , the clone #1 was designated to MTL1 , the 
clone # 6 was designated to MTL2, and the clone #35 was designated to MTL3, respectively. 

[0021] The alignment compared among amino acid sequences encoded by MXMT1 , MTL1 , MTL2 and MTL3 was 
shown in Fig. 3. As a result, it was shown that these four sequences exhibit extremely high homology. To confirm the 
35 functions charge by these polypeptides, genes corresponding to each clone were expressed in E. coli to confirm their 
enzymatic activities. 

(Expression of GST fused protein) 



20 



40 



[0022] The open reading frame regions of MTL1 (Clone #1 ), MTL2 (Clone #6), MTL3 (Clone #35) and MXMT1 (Clone 
#45) were amplified by PCR (polymerase chain reaction). Then, they were optionally cloned into pGEX 4T-2 vector 
(Pharmacia) and E. coli (JM109) cells were transformed with the resulting plasmids. The obtained E. coli cells were 
cultured in LB liquid medium containing ampicillin. When OD600 of the culture reached to 0.5, IPTG (isopropyl thio- 
p-D-galactoside) was added to it and the final concentration of IPTG was made to 1 mM, then the mixture was further 
45 cultured at 16°C for 6 hours. E. coli was desrupted by a sonicator and the protein of the purpose was purified by 
glutathione Sepharose 4B as a GST (glutathione S-transf erase) fusion protein. Concentration of the protein was meas- 
ured by the Bradford method. Each of the GST fusion protein (500 ng) was separated by SDS-PAGE (sodium dodecyl 
sulfate polyacrylamide gel electrophoresis), then it was stained by CBB (coumasie Brilliant Blue) to confirm purification. 
The purities of the resulting GST fusion proteins were analyzed by SDS-PAGE and the results were shown in Fig. 4. 
50 in Fig. 4, lane 1 shows the result of MTL2 fusion protein, lane 2 shows the result of MTL3 fusion protein, lane 3 shows 
the result of MXMT1 fusion protein, respectively. As a result, the resulting three fusion proteins were shown to be 
approximately pure. 



55 



(Measurement of enzymatic activities by thin layer chromatography) 

[0023] Measurement of enzymatic activity was performed using thin layer chromatography (TLC), based on the meth- 
od of Kato et al. (Plant Physiol., 1996, 98, 629-636). In concrete, the reaction mixture of 100 ul, containing 100 mM 
Tris-HCI (pH 7.5), 200 uJvl substrate (xanthine, 7-methylxanthine, theobromine, paraxanthine, theophylline), 4 u,M 14 C- 
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labeled S-adenosylmethionine, 200 uM MgCI 2 , 200 ng GST fusion protein, was incubated at 27°C for 2 hours. After 
the reaction, the resulting mixture was extracted with 1 ml of chloroform, the chloroform layer was recovered, then 
chloroform was evaporated by speed back concentrator. The residue was dissolved in 5 u.l of 50% methanol solution, 
then the solution was developed by TLC (solvent for development was water:acetic acid:n-butanol= 2:1 :4, v/v/v). After 

5 the development, signal of radio activity was detected by image analyzer (Fuji BAS 2000). The result of enzymatic 
activity, which was measured on the fusion proteins derived from MTL2, MTL3 and MXMT1 using xanthine (X), 7-meth- 
ylxanthine (7-Mx), theobromine (Tb), paraxanthine (Px) and theophylline (Tp) as the substrate, was shown in Fig. 5. 
From Fig. 5, it was revealed that the fusion protein derived from MXMT1 exhibited potent activity to synthesize theo- 
bromine, using 7-methylxanthine as the substrate. The fusion protein derived from MXMT1 also exhibited activity to 

10 synthesize caffine, using paraxanthine as the substrate, but its relative activity was 1 5% of the above-mentioned activity. 
On the other hand, the fusion proteins derived from MTL2 and MTL3 did not exhibit activity as a methyl transferase, 
using the above-mentioned compounds as the substrate. 

(Enzymatic activity measurement and identification of the product by HPLC) 

15 

[0024] Using high performance liquid chromatography (HPLC), enzymatic activity of the MXMT1 fusion protein was 
measured and reaction product obtained from the enzymatic reaction was identified. The reaction mixture of 100 §xl, 
containing 100 mM Tris-HCI (pH 7.5), 200 u.M of substrate (7-methylxanthine, paraxanthine, theobromine), 50 of 
S-adenosylmelhionine, 200 u.M of MgCI 2 , 200 ng of GST fusion protein, was incubated at 27°C for 2 hours. After 

20 incubation, the mixture was extracted with 1 ml of chloroform, the chloroform layer was recovered, then chloroform 
was evaporated by a speed back concentrator. The residue was dissolved in 50 \i\ of 1 2% acetonitrile. Then the solution 
was fractionated by HPLC (Shodex Rspak DS-613 column) provided with UV detection system. As the solution for 
development, 12% acetonitrile was used and the signal was detected for absorbance of 254 nm. 
[0025] The result was shown in Fig. 6. The MXMT1 fusion protein was reacted with S-adcnosylmethionine and 

25 7-methylxanthine, which is the substrate and the reaction product was analyzed by HPLC. The chart exhibiting the 
result was shown in Fig. 6A. Moreover, theobromine was analyzed for a standard compound using HPLC and the chart 
exhibiting the result was shown in Fig. 6B. For preparation of negative standard, the MXMT1 fusion protein, S-adeno- 
sylmethionine and 7-methylxanthine was mixed and the reaction was immediately stopped and the chart exhibiting the 
result was shown in Fig. 6C. For standard products, 7-methylxantine, theobromine, paraxanthine and caffeine were 

30 analyzed by HPLC, and the chart exhibiting the result was shown in Fig. 6D. Furthermore, S-adenosylmethionine and 
7-methylxanthine was reacted with MXMT1 fusion protein and then theobromine was added to the reaction mixture. 
The chart exhibiting the result was shown in Fig. 6E. The peak position of the reaction product detected in Fig. 6A 
coincided with the position of theobromine, which was analyzed as the standard compound. In addition, when theo- 
bromine was added to the enzymatic reaction mixture, only one peak was observed. Therefore, it was shown that 

35 theobromine was formed by enzymatic reaction of the MXMT1 fusion protein, using 7-methylxantine as the substrate. 
[0026] According to the present invention, the polypeptide of theobromine synthase derived from coffea arabica and 
the gene encoding said polypeptide were provided. As theobromine synthase participates in biosynthesis of caffeine, 
caffeineless coffee would be obtained by preparing a transformed plant, wherein expression of gene encoding said 
enzyme was inhibited. 

40 
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50 
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Sequence Listing 

<1 10> President of Nara Institute Science and Technology 

<1 20> Theobromine synthase polypeptide of coffee plant and the gene encoding sai 

polypeptide 

<160> 8 

<210> 1 

<211>378 

<212> Amino acid 

<21 3> Caffea arabica 

<400> 1 

MELQEVLHMN EGEGDTSYAK NASYNLALAK VKPFLEQCIR ELLRANLPNI NKC I KVADLG 60 
CASGPNTLLT VRDIVQSIDK VGQEEKNELE RPTIQIFLND LFQNDFNSVF KLLPSFYRKL 120 
EKENGRKIGS CLISAMPGSF YGRLFPEESM HFLHSCYSVH WLSQVPSGLV IELGIGANKG 180 
SIYSSKGCRP PVQKAYLDQF 7KDFTTFLRI HSKELFSRGR MLLTC1CKVD EFDEPNPLDL. 2 40 
LDMAINDLIV EGLLEEEKLD SFNIPFFTPS AEEVKCIVEE EGSCEILYLE TFKAHYDAAF 3 00 
SIDDDYPVRS HEQIKAEYVA SLIRSVYEPI LASHFGEAIM PDLFHRLAKK AAKVLHMGKG 3 60 
CYNNLIISLA KKPEKSDV 378 

<210>2 

<211> 1298 

<21 2> Nucleic acid 

<2i3> Caffea arabica 

<400> 2 

AGCAGTCGCA ATTCGATTGT CCTGCATATG AATGGAGCTC CAAGAAGTCC TGCATATGAA 60 

TGAAGGTGAA GGCGATACAA GCTACGCCAA GAATGCATCC TACAATCTGG CTCTTGCCAA 120 

CGTGAAACCT TVCCTTCAAC AATGCATACG ACAATTCTTG CGGGCCAACT TGCCCAACAT 180 

CAACAAGTGC ATTAAAGTTG CGGATTTGGG ATGCGCTTCT GGACCAAACA CACTTTTAAC 240 

AGTGCGGGAC ATTGTGCAAA GTATTGACAA AGTTGGCCAG GAAGAGAAGA ATGAATTAGA 3 00 

ACGTCCCACC ATTCAGATTT TTCTGAATGA TCTTTTCCAA AATGATTTCA ATTCGGTTTT 3 60 

CAAGTTGCTG CCAAGCTTCT ACCGCAAACT CGAGAAAGAA AATGGACGCA AGATAGGATC 420 

GTGCCTAATA AGCGCAATGC CTGGCTCTTT CTACGGCAGA CTCTTCCCCG A GG A GTC C A T. 4 80 

GCATTTTTTG CACTCTTGTT ACAGTGTTCA TTGGTTATCT CAGGTTCCCA GCGGTTTGGT 54 0 
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GATTGAATTG GGGATTGGTG CAAACAAAGG GAGTATTTAC TCTTCCAAAG GATGTCGTCC 600 
GCCCGTCCAG AAGGCATATT TGGATCAATT TACGAAAGAT TTTACCACAT TTCTAAGGAT 660 
TCATTCGAAA GAGTTGTTTT CACGTGGCCG AATGCTCCTT ACCTGCATTT GTAAAGTAGA 720 
TGAATTCGAC GAACCGAATC CCCTAGACTT ACTTGACATG GCAATAAACG ACTTGATTGT 7 80 
TGAGGGACTT CTGGAGGAAG AAAAATTGGA TAGTTTCAAT ATTCCATTCT TTACACCTTC 840 
AGCAGAAGAA GTAAAGTGCA TAGTTGAGGA GGAAGGTTCT TGCGAAATTT TATATCTGGA 900 
GACTTTTAAG GCCCATTATG ATGCTGCCTT CTCTA7TGAT GATGATTACC CAGTAAGATC 960 
CCATGAACAA ATTAAAGCAG AGTATGTGGC ATCATTAATT AGATCAGTTT ACGAACCCAT 102 0 
CCTCGCAAGT CATTTTGGAG AAGCTATTAT GCCTGACTTA TTCCACAGGC TTGCGAAGCA 1080 
TGCAGCAAAG GTTCTCCACA TGGGCAAAGG CTGCTATAAT AATCTTATCA TTTCTCTCGC 114 0 
CAAAAAGCCA GAGAAGTCAG ACGTGTAAAA GTTTGTTTTT AGTTGGTTTT TGTGCCGTTG 12 00 
GGGGTCTTTC GGGTATTGTC GTTTTGTATT CGTAATAAAA GTGATGTGCA AGAATAAGAT 12 50 
ATTTAGTACA ATATTTTCAT AAAAAAAAAA AAAAAAAA 1298 

<2IO> 3 
<2ll>385 
<212> Amino acid 
<21 3> Caffea arabica 
<400> 3 

M2LQEVLHKN GGEGEASYAK NSSFNQLVLA KVKPVLEQCV RELLRANLPN INKCIKVADL 6 0 
GCASGPNTLL TVWDTVQSID KVKQEKXNEL ERPTTQVFLT DLFQNDFNSV FMLLPSFYRK 12C 
LSKEMGRKIG SCLIAAMPGS FHGRLFFEES MHFLHSSYSL QFLSQVPSGL VTEL.GI TANK 180 
RSIYSSKASP PPVQKAYLDQ FTKDFTTFLR MRSEELLSRG RMLLTCICKG DECDGPNTKD 24 0 
LLEMAINDLV AECRLGEEXL DSFNVPIYTA SVEEVKCMVE EEGSFEILYL QTFKLRYDAG 3 00 
FSIDDDCQVR SHSPVYSDEH ARAAHVASLI RSVYEPILAS HFGEAIIPDI FHRFATNAAK 360 
VI RLGKGFYN NLIISLAKKP EKSDI 385 

<210>4 
<2il> 1360 
<212> Nucleic acid 
<21 3> Caffea arabica 
<400> 4 

GTCCTGCATA TGAATGGAGC TCCAAGAAGT CCTGCATATG AATGGAGGCG AAGGCGAAGC 60 
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AAGCTACGCC AAGAATTCAT CCTTCAATCA ACTGGTTCTC GCCAAGGTGA AACCTGTCCT 120 
TGAACAATGC GTACGGGAAT TGTTGCGGGC CAACTTGCCC AACATCAACA AGTGCATTAA 180 
AGTTGCAGAT TTGGGATGCG CTTCCGGACC AAACACACTT TTAACCGTTT GGGACACTGT 24 0 
ACAAAGTATT GACAAAGTTA AGCAAGAAAT GAAGAATGAA TTAGAACGTC CCACCATTCA 3 00 
GGTTTTTCTG ACTGATCTTT TCCAAAATGA TTTCAATTCG GTTTTCATGC TGCTGCCAAG 360 
CTTCTACCGC AAACTTGAGA AAGAAAATGG ACGCAAAATA GGATCGTGCC TAAT AGCCGC 4 20 
AA TGCCTGGC TCTTTCCACG GCAGACTCTT CCCCGAGGkG TCCATGCATT TTTTACACTC 4 80 
TTC7TACAGT CTTCAGTiTT TATCCCAGGT TCCCAGCGGT TTGGTGACTG AACTGGGGAT 54 0 
CAGTGCGAAC AAAAGGAGCA TTTACTCTTC CAAAGCAAGT CCTCCGCCCG TCCAGAAGGC 6 00 
ATATTTGGAT CAATTTACGA AAGATTTTAC CACATTTTTA AGGATGCGTT CGGAAGAGTT 660 
GCTTTCACGT GGCCGAATGC TCCTTACTTG. CATTTGTAAA GGAGATGAAT GCGACGGCCC 720 
GAATACCATG GACTTACTTG AG ATGGC AAT AAACGACTTG GTTGCTGAGG GACGTCTGGG 780 
GGAAGAAAAA TTGGACAGTT TCAATGTTCC AATCTATACA GCTTCAGTAG AAGAAGTAAA 84 0 
GTGCATGGTT GAGGAGGAAG GTTCTTTTGA AATTTTATAC TTGCAGACTT TTAAGCTCCG. 900 
TTATGATGCT GGCTTCTCTA TTGATGATGA TTGCCAAGTA AGATCCCATT CCCCAGTATA 960 
CAGCGATGAA CATGCTAGAG CAGCGCATGT GGCATCATTA ATTAGATCAG TTTACGAACC 1020 
CATCCTAGCA AG7CATTTTG GAGAAGCTAT TATACCTGAC ATATTCCACA GGTTTGCGAC 1060 
GAATGCAGCA AAGGTTATCC GCTTGGGCAA AGGCTTCTAT AATAATCTTA TCATTTCTCT 114 0 
TGCCAAAAAA CCAGAGAAGT CAGACATATA AAAGCTTGTT TTTAGTTGGT . TTTTCTGTTA 1200 
TGGGTTGTTT TCTGATACGG GGAAAGGATT CAGTGCGGTT GGGGTTCTAT CCGAGTAT7G 12 60 
TACTTTTTAT ATTATTAGTT GGTGTATAAT TATTATGTTA CATTGTTATA TTCGTAATAA 132 0 
AAGTGACGTA CAAAAATAAA ATATTTTCAT AAAAAAAAAA 13 60 

<210>5 
<211>385 
<212> Amino acid 
<213> Caffca arabica 
<400> 5 

MELQEVLHMN GGEGDASYAK NSSFNQLVLA KVKPVLEQCV GELLRANLPN TNKCIKVADL 6 0 
GCASGPNTLL TVRDIVQSID KVRQEMKNEL ERPTIQVFLT DLFQKDFNSV FMLLPSFYRK 120 
LEXENGRKIG SCLIAAMPGS FHGRLFPEES MHFLHSSYSL QFLSQVPSGL VTELG1TANK 18 0 
RSIYSSKASP PPVQKAYLDQ FTKDFTTFLR IRSEELLSRG RMLLTCICKG DEFDGPNTMD 24 0 
LLEMA1NDLV VEGHIjEEEKL DSFNVPIYAA SVEELKCIVE EEGSFEILYL ETFKLP.YDAG 300 



EP1 197,558 A2 % 



20 



25 



FSIDDDCQVR SHSPEYSDEH ARAAHVASLL RSVYEPILAN HFGEAIIPDI FHRFATNAAK 3 60 
VTRLGKGFYN MLTISLAKKP EKSDI 3g5 

<210>6 
w <211>1304 

<212> Nucleic acid 

<213> Caffea arabica 
75 <400> 6 

TTTAGCAGTC CCAATTCGAT TTATGTACAA GTCCTGCATA TGAATGGAGC TCCAAGAAGT 6 0 
CCTGCATATG AATGGAGGCG AAGGCGATGC AAGCTACGCC AAGAATTCAT CCTTCAATCA 120 
ACTGGTTCTC GCCAAGGTGA AACCTGTCCT TGAACAATGC GTAGGGGAAT TGTTGCGGGC 180 
CAACTTGCCC AACATCAACA AGTGCATTAA AGTTGCGGAT TTGGGATGCG CTTCCGGACC 240 
AAACACACTT TTAACAGTTC GGGACATTGT ACAAAGTATT G AC AAAGT T A GGCAAGAAAT 3 00 
GAAGAATGAA TTAGAACGTC CCACCATTCA GGTTTTTCTG ACTGATCTTT TCCAAAATGA 3 60 
TTTCAATTCG GTTTTCATGT TGCTGCCAAG TTTCTACCGC AAACTTGAGA AAGAAAATGG 42 0 
3Q ACGCAAGATA GGATCGTGCC TAATAGCCGC AATGCCTGGC TCTTTCCACG GCAGACTCTT 480 

CCCCGAGGAG TCAATGCATT TTT7ACACTC TTCTTACAGT CTTCAATTTT TATCCCAGGT 54 0 
TCCCAGCGGT TTGGTGACTG AATTGGGGAT CACTGCGAAC AAAAGGAGCA TTTACTCTTC 60 0 
35 CAAAGCAAGT CCTCCGCCCG TCCAGAAGGC ATATTTGGAT C AATTT AC G A AAGATTTTAC 6 60 

C AC A TTTTT A AGGATTCGTT CGGAAGAGTT GCTTTCACGC GGCCGAATGC TCC TTAC TTG 72 0 
CATTTGCAAA GGAGATGAAT TCGACGGCCC GAATACCATG GACTTACTTG AGATGGCAAT 78 0 
AAACGACTTG GTTGTTGAGG GACATC TGGA GGAAGAAAAA TTGGACAGTT TCAATGTTCC 84 0 
AATCTATGCA GCTTCAGTAG AAGAATTAAA GTGCATAGTT GAGGAGGAAG GTTC TTTTGA 900 
AATTTTGTAC TTGGAGACTT TTAAGCTCCG TTATGATGCT GGCTTCTCTA TTGATGATGA 9 60 
TTGCCAAGTA AGATCCCATT CCCCAG.^ATA CAGCGATGAA CATGCTAGAG CAGCGCATGT 102 0 
GGCATCATTA CTTAGATCAG TTTACGAACC CATCCTCGCA AATCATTTTG GAGAAGCTAT 108 0 
TATACCTGAC ATATTCCACA GGTTTGCGAC GAATGCAGCA AAGGTTATCC GCTTGGGCAA 1140 
AGGCTTCTAT AATAA TCTTA TCATTTCTCT TGCCAAAAAA CCAGAGAAGT CAGACATATA 12 CO 
AAAGCTTGTT TATAGTTGGT TTTTGTGCTA TGGTTTGTTT TCTGATACGG GGAAAGGATT 12 60 
TAGTGCGGTT GGGGTTCAAA AAAAAAAAAA AAAAAAAAAA AAAA 1304 



40 



45 



50 



55 



10 

3CID: <EP 119755SA2_I_> 



EP1 197 558 A2 



<210>7 
<211>372 
<212> Amino acid 
<2l3> Caffea arabica 
<400> 7 

KELQEVLRMN GGEGDTCYAK NSAYNQLVLA KVKPVLEQCV RELLRANLPN INKCIKVADL 60 
GCASGPNTLL TVRDIVQSID KVGQEKKNEI, ERPTIQIFLN DLFPNDFNSV FKLL PSFYRK 120 
LEKEKGRKIG SCLIGAMPGS FYSRLFPEES MHFLHSCYCI, QWLSQVPSGL VTELGISTNK 18C 
GSIYSSXASR LPVQKAYLDQ FTKDFTTFLR I HSEELFSHG RMLLTCICKG VELD ARM A I D 24C 
LLEMAINDLV VEGHLEEEKL DSFNLPVYIP SAEEVKCIVE EEGSFEILYL ETFKVLYDAG 3 0C 
FSIDDBHTKA EYVASSVRAV YEPILASHFG EAITPDIFHR FAKHAAKVLP LGKGFYNNLI 350 
ISLAKKP3KS DV 372 

<210>8 
<211> 1316 
<212> Nucleic acid 
<213> Caffea arabica 
<400> 8 

CTTTGGCAGT CCCAATTTGA TTTATGTACA A GTCCTGC AT ATGAATGGAG CTCCAAGAAG 6 0 

TCCTGCGGAT GAATGGAGGC GAAGGCGATA CAAGCTACGC CAAGAATTCA GCCTACAATC 120 

AACTCGTTCT CGCCAACCTC AAA C CTGTC C TTCAACAATG CGTACCCGAA TTGTTGCGGG 18C 

CCAACTTGCC CAACATCAAC AAGTGCATTA AAGTTGC G G A TTTGGGATGC GCTTCTGGAC 24C 

CAAACACACT TTTAACAGTT CGGGACATTG TCCAAAGTAT TGACAAAGTT GGCCAGGAAA 3 0C 

AGAAGAATGA ATTAGAACGT CCCACCATTC AGATTTTTCT GAATGATCTT TTCCCAAATG 3 60 

ATTTCAATTC GGTTTTCAAG TTGCTGCCAA GCTTCTACCG CAAACTTGAG AAAGAAAATG 4 20 

GACGCAAAAT ACGATCGTCC CTAATAGGGC CAATGCCCGG CTCTTTCTAC AGCACACTCT 4 8C 

TCCCCGAGGA GTCCATGCAT TTTTTACACT CTTGTTACTG TCTTCAATGG TTATCTCAGG 540 

TTCCTAGCGG TTTGGTGACT GAATTGGGGA TCAGTACGAA CAAAGGGAGC ATTTACTCTT 6 00 

CCAAAGCAAG TCGTCTGCCC GTCCAGAAGG C AT ATTTGG A TCAATTTACG AAAGATTTTA 660 

CCACATTTCT AAGGATTCAT TCGGAA3AGT TGTTTTCACA TGGCCGAATG CTCCTTACTT 720 

GCATTTGTAA AGGAGTTGAA TTAGACGCCC GGAATGCCAT AGACTTACTT GAGAT3GCAA 7 80 

TAAACGACTT GGTTGTTGAG GGACATCTGG AGGAAGAAAA ATTGGATAGT TTCAATCTTC 84 0 

CAGTCTATAT ACCTTCAGCA GAAGAAGTAA AGTGCATAGT TGAGGAGGAA GGTTCTTTTG 9 00 
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AAATTTTATA CCTGGAGACT TTTAAGGTCC TTTACGATGC TGGCTTCTCT ATTGACGATG 960 
5 AACATATTAA AGCAGAGTAT GTTGCATCTT CCGTTAGAGC AGTTTACGAA CCCATCCTCG 1020 

CAAGTCATTT TGGAGAAGCT ATTATACCTG ACATATTCCA CAGGTTTGCG AAGCATGCAG 1080 
CAAAGGTTCT CCCCTTGGGC AAAGGCTTCT ATAATAATCT TATCATTTCT CTCGCCAAAA 1140 
W AGCCAGAGAA GTCAGACGTG TAAAAGTTTG TTTTTGTGTT GGGGAAAGGA ATAAGTGCCG 1200 

TTGGGGGTCT TTCGGGTATT GTGCTTTTTA TATTATATTG TTTTGTATCC GTAATAAAAG 12 60 
TGGTGTGTAA GAATAAGATA TTTGACATAT ATTATTTTCA AAAAAAAAAA AAAAAA 1316 



15 



Claims 

20 1 . A polypeptide consisting of an amino acid sequence of following (a) or(b): 

(a) an amino acid sequence defined by amino acid numbers from 1 to 378 shown in SEQ ID NO- 1 in a 
Sequence List, 

(b) an amino acid sequence in which a part of said amino acid sequence (a) is deleted or another amino acid 
sequence .s added to said amino acid sequence (a) or a part of amino acid sequence (a) is substituted with 

u!In 7 aC, t Sequen< r 6 ' the amino acid sequence (b) having the activity to biosynthesize theobromine 

using 7-methylxanthine as the substrate. 

' JuSlSS^ T' 31 '" 9 ° f ^ amin ° Sequence exhibitin S at least 9 °% ^ homology with an amino acid se- 
quence defined by amino acid numbers from 1 to 378 shown in SEQ ID NO: 1 in a Sequence List. 



25 



30 



35 



3. A gene encoding the polypeptide according to Claims 1 or 2. 

4. A gene consisting of a base sequence of following (c), (d) or (e): 



S 3 ^ Sequence de,ined b V base "ambers from 1 to 1298 shown in SEQ ID NO. 2 in a Sequence List 
(d a base sequence in which a part of base sequence (c) is deleted or another base sequence is added to 
said base sequence (c) or a part of base sequence (c) is substituted with another base sequence the base 
<o arthrs C ubs^e COdin9 3 POlyPeptide h3Ving the activity t0 biosynthesize theobromine using 7-methylxanthine 

(e) a base sequence that hybridizes with said base sequence (c) under stringent condition, the base sequence 
iubstra°e * P °' yPeptlde haVing the activit y t0 biosynthesize theobromine using 7-methylxanthine as the 

6 - t::xrz:iT:z expression of the 9ene accordin9 to ciaims 3 to 5 * »• ^ * ™* 

50 

7 " t^brominr ed ^ aCC ° rdin9 t0 ° ,aim 6 ' Wherein antisens * Qene method is utilized to inhibit biosynthesis of 

8. The transformed plant according to Claim 6, wherein said plant is selected from the group consisting of Coffea 
55 arabica, Coffea canephora, Coffea liberica and Coffea dewevrei. 

9. A seed obtained from the transformed plant according to Claims 6 or 8. 



XDCID:<EP 1197558A2 I > 



12 



EP 1 197 558 A2 

10. A transformed plant wherein gene according to Claims 3 to 5 is introduced in the plant to increase biosynthsis of 
theobromine. 

11. The transformed plant according to Claim 10, wherein said plant is selected from the group consisting of Coffea 
arabica, Coffea canephora, Coffea liberica and Coffea dewevrei. 

12. A seed obtained from the transformed plant according to Claims 10 or 11 . 

13. A method for production of a transformed plant in which biosynthesis of theobromine is inhibited in the plant by 
decreasing expression of the gene according to Claims 3 or 5. 

14. The method according to Claim 13, wherein antisense gene method is utilized to inhibit biosynthesis of theobro- 
mine. 

15. A method for production of a transformed plant in which biosynthesis of theobromine is enhanced in the plant by 
enhancing expression of the gene according to Claims 3 or 5. 
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FIG. 2 



GTCCTGCATA TGAATGGAGC TCCAACAACT CCTGCATATG AATGGAGCCG AAGCCGAAGC AAGCTACGCC AAGAATTCAT CCTTCAATCA 90 

ACTGGTTC7C GCCAAGGTGA AACCTGTCCT TGAACAATGC GTACGCGMT TGTTCCGCCC CAACTTGCCC AACATCAACA ACTGCATTAA 1M 

AGTTCCAGaT TTGGCATCCC CTTCCGCACC AAACACACTT TTAACCCT7T (KaCACTGT AOMCTATT GAC^GTTA A«£^L" 270 

GAAGAATGAA TTAGAACGTC CCACCATTCA GCTTTnCTG ACTCATOTT TCCAAAATGA TTTCAATTCC GTTTTCATGC TGCTGCCAAC 3« 

cttctaccgc aaacttgaga aagaaaatgg acgcaaaata ggatcgtgcc taataccccc aatgcctggc TCTTTCCACC GCACACTCTT 458 

CCCCUGCAC TCCATCCATT TTTTACACTC TTCTTACACT CTTCACTTTT TATCCCACCT TCCCACCCCT TTCGTGACTC AATTGGGGAT 548 

CACT CCCAA C AAAAGCAGCA TTTACTCTTC CAAAGCAACT CCTCCGCCCC TCCAGAAGGC ATATTTCUr CAATTTACGA AAGATTTTAC 630 

CACATTTTTA AGGATGCCTT CGGAACACTT GCTTTCACGT G6CCGAAT6C TCCTTACTTG CATTTCTAAA GGAGATGAAT GCGACGGCCC 728 

GAATACCATG GACTTACTTG AGATGGCAAT AAACGACTTG GTTGCTGAGG CACCTCTGGG ggaagaaaaa TTGGACAGTT KAATGTTCC SIC 

AATOATACA GCTTCAG7AG aagmgtaaa gtgcatgctt gaggaggaag cttcttttga aattttatac ttgcagactt T7AAGCTCCG 900 

TTATCATGCT GGCTTCTCTA TTGATGATGA TTGCCAACTA AGATCCCATT CCCCAGTAfA CAGCGATGAA CA7GCTAGAG CAGCGCATGT 990 

GCCATCATTa ATTAGATCAG TTTACGAACC CATCCTAGCA AGTCATTTTG GAGAAGCTAT TATACCTGAC ATATTCCACA GGTTTGCGAC 1W3 

GAA7GCA6CA AAGGTTATCC GCTTGGGCAA AGGCTTCTAT AATAATCTTA TCATTTCTCT TCCCAAAAAA CCAGACAACT CAGACATATA 1170 

AAAGCTTGTT TTTAGTTGGT TTTTCTCTtA TGCCTTCTTT TCTGATACGG GGAAACGATT CAGTGCGGTT GGGGTTCTAT CCGA6TATTG 1260 

T At i i i ii AT ATTATTAGTT GGTCTATaaT TATTaTGTTA CATTGTTATA TTCGTAATAA AAGTGACGTA CAAAAATAAA ATATTITCAT 1350 



B 

TTTAGCAGTC CCAATTCGAT TTATGTACAA GTCCTGCATA TGAATGGAGC TCCAACAACT CCTCCATATC AATCCACGCG AAGCCCATCC 90 

AAGCTaCGCC AAGAATTCAT CCTTCAATCA ACTGGTTCTC GCCAAGGTGA AACCTGTCCT TGAACAATGC GTAGGGCAAT 7GTTGCGCGC 166 

CAACTTGCCC AACATCAACA AGTGCATTAA AGTTGCGGAT TTCGGATGCG CTTCCGCACC AAACACACTT TTAACAGTTC GGGACATTGT 270 

ACaAAGTATT GACAAAGTTA GGCAAGAAAT GAAGAATGAA TTAGAACGTC CCACCATTCA CGTTTTTCTG ACTGATCTTT TCCAAAATGA 360 

TTTCMTTCG CTTTTCATGT TGCT4CCAAC TTTCT ACCGC AAACTTGAGA AACAAAATGG ACGCAAGaTA GGATCGTGCC TAATAGCCGC 450 

AATCCCTGGC TCTTTCCACC GCACACTCTT CCCCGAGGAC TCAATGCATT TTTTACACTC TTCTTACACT CTTCAATTTT TATCCCACCT S40 

TCCCACCCCT TTGGTGACTG AATTGCCCAT CACTGCGAAC AAAAGCAGCA TTTACTCTTC CAAAGCAACT CCTCCGCCCC TCCAGAAGGC 630 

ATATTTGCAT CAATTTACCA AACATTTTaC CACATTTTTA AGCATTCCTT CGGAACACTT CCTTTCACCC CCCCCAATCC TCCTTACTTG 720 

CATTTGCAAA GGAGATGAAT TCGACOGCCC GAATACCATG GACTTACTTG AGATGGCAAT AAACGACTTG GTTCTTGAGG GACATCTGG* 810 

GGAAGAAAAA TTGGACAGTT TCAATGTTCC AATCTATGCA GCTTCACTAC AAGAATTAAA CTGCATAGTT GAGGAGGAAG GTTCTTTTGA 900 

AATTTTGTAC TTGGACACTT TTAACCTCCC TTATCATGCT CCCTTCTCTA TTGATGATGA TTCCCAAGTA AGATCCCATT CCCCAGAATA 990 

CAGCGATGAA CATGCTACAC CACCCCATCT GCCATCATTA CTTACATCAG TTTACGAACC CATCCTCGCA AATCaTTTTC GaCAACCTAT 1080 

TATACaGAC ATATTCCACA GGTTTGCGAC GAATCCAGCA AAGGTTATCC GCTTGGCCAA AGGCTTCTAT AATAATCTTA TCATTTCTCT 1170 

TCCCAAAAAA CCAGACAACT CAGACATATA AAAGCTTGTT TATACTTGGT TTTTGTCCTA TCGTTTCTTT TCTGATACGG GGAAAGCATT 1260 

TACTGCGCTT GGGGTTCAAA AAA A AAAA AA A A AAAAAA A A AAAA 1304 



CTTTGGCAGT CCCAATTTGA TTTATGTACA AGTCCTGCaT ATGAATGCAG CTCCAACAAG TCCTGCGGAT CAATGCAGGC GAAGGCGATA 90 

CAAGCTACGC CAAGAATTCA GCCTaCAATC AACTOCTTCT CGCCAAGGTG AAACCTCTCC TTGAACAATG CCTACGGCAA TTCTTCCGGG 180 

CCAACTTGCC CAACATCAAC AAGTGCATTA AAGTTGCGGA TTTGGCATCC GCTTCTCGAC CAAACACACT TTTAACACTT CGGCACATTC 270 

TCCAAAGTAT T GACAAA GTT GGCCAGGAAA AGAAGAATGA AfTAGAACGT CCCaCCATTC AGATTTTTCT GAATGATCTT TTCCCAAATC 360 

ATTTCAATTC GGTTTTCAAG TTGCTGCCAA GCTTCTACCG CAAACTTGAG AAAGAAAATG GACGCAAAAT AGGATCCTGC CTAATAGGCG 4S0 

CAATGCCCGG CTCTTTCTAC AGCAGACTCT TCCCCGACCA GTCCATGCAT TTTTTACACT CTTCTTACTC TCTTCAATGG TTATCTCAGG S40 

TTCCTaGCGG TTTGGTGACT GAATTCCGCA TCAGTACGAA CAAAGCCAGC ATTTACTCTT CCaAAGCAAG TCGTCTGCCC GTCCAGAAGC 630 

CATATTTGGA TCAATTTACG AAAGATTTTA CCaCATTTCT AAGGATTCAT TCGGAAGAGT TCTTTTCACA TGGCCGAATG CTCCTTACTT 720 

GCATTTCTAA AGCACTTGAA TTACACCCCC GGAATCCCAT ACACTTACTT CACATCCCAA TaaACGACTT CCTTCTTCAC CCACATCTCC 110 

AGGAACAAAA ATTGGATaCT TTCAATCTTC CAGTCTATAT accttcagca gaagaactaa actccatact TGACGACGAA gcttcttttg 900 

AAATTTTATA cctggacact tttaaggtcc tttacgatgc tcccttctct attcacgatg aacatattaa agcacagtat cttccatctt 999 

CCCTTACAGC ACTTTACGAA CCCATCaCG CAAGTCATTT TCGAGAACCT ATTATACCTG ACATATTCCA CACCTTTGCG aaGCATCCAC 1080 

CAAAG GTTCT CCCCTTGGCC AAACGCTTCT ATAATAATCT TATCATTTCT CTCGCCAAAA ACCCAGA6AA GTCAGACCTG TAAAAGTTT5 1170 

TTTTTGTGTT GCCGAAACCA ATAASTGCCG TTG6GWTCT nCGGGTATT GTGCTTTTTA TATTATATTG TTTTGTATCC GTAATAAAAG 1260 
TGCTGTGTAA GAATAAGATA TTTGACATAT ATTATTTTCA AAAAAAAAAA AAAAAA 



o 

AGCACTCGCA ATTCCATTCT CaGCATATG AATGCAGCTC caacaactcc 
GAATGCATCC 7ACAATCTGG aCTTCCCAA GGTCAAACCT rTCCTTGAAC 
CAACAACTCC ATTAAAGTTG CGGATTTGGG ATGCCCTTCT GCACCAAACA 
AGTTGGCCAG GAaGACAACA ATGAATTaCA ACGTCCCACC ATTCACATTT 
CAACTTCCTG CCAACCTTCT ACCGC AAA C7 CGAGAAAGAA AATGCACCCA 
CTACGCCAGA aCTTCCCCG AGGAGTCCAT gcattttttg CACTCTTGTT 
CATTCAATTG CGGArrCGTC caaacaaacc gactatttac TCTTCCAAAG 
TACGAAAGAT TTTACCACAT TTCTaAGGAT TCATTCGAAA GAGTTGTTTT 
tgaattcgac GAACCGAATC CCCTaGACTT ACTTGACATG GCAATAAACG 
TaGTTTCAAT ATTCCATTCT TTACACCTTC ACCAGAAGAA GTAAAGTGCA 
GACTTTTAAC GCCCATTATC ATGCTGCCTT CTCTATTGAT GATGATTaCC 
ATCATTAATT AGATCACTTT acgaacccat cctcgcaact CATTTTCCAG 
TGCA GCAAAG GTTCT CCACA TGGGCAAAGG CTCCTArAAT AAfCTTATCA 
GTTTGTTTTT AGTTGCTTTT TGTGCCGTTG GCGGTCTTTC GCCTATTGTC 
ATTTAGTACA ATATTTTCAT AAAAAAAAAA AAAAAAAA 



TCCATATCAA TCAACCTCAA GCCGATACAA GCTaCGCCAA 90 

AA TGCAT ACG ACAAT7GTTG CGGGCCAACT TGCCCAACAT 180 

CACTTTTAAC ACTGCGGCAC ATTCTGCAAA GTATTGACAA 270 

TTCTGAATGA TCTTTTCCAA AATGATTTCA ATTCGGTTTT 360 

AGATAGCATC CTGCCTAATA ACCGCAATGC CTGGCT CTTT 4 SO 

ACAGTGTTCA TTGGTTATCT CAGGTTCCCA GCGGTTTGGT 540 

GAT GTCGTCC CCCCCTCCAC AAGGCATATT TCGATCAATT 630 

CACGTGGCCG AATGCTCCTT ACCTGCATTT GTAAAGTAGA 720 

acttcattgt tgagggactt CTGGAGGAAG AAAAATTGGA 810 

TICTTGAGCA CCAACCTTCT TGCGAAATTT TATATCTGCa 900 

CAGTAAGATC CCATCAACAA ATTAAACCAG AGTATCTCGC 990 

AACCTATTAT GCCTGACTTA TTCCACAGGC TTGCGAAGCA 1080 

TTTCTCTCGC CAAAAACCCA GAGAAGTCAG ACGTGTAAAA U70 

GTTTTffTATT CGTAATAAAA GTGATGTGCA AGAATAAGAT 1260 

1298 
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FIG. 3 

MXMTl MgJQEVI«fl*EUHJJlbYM^^ 49 

MTL1 :G: : :EAr : : : :S:F:Q:V: :::: ;V: :: :V: :::::::: ; 50 

MTL2 :G: : : : A: : : r:S:F:Q:V: : ; : : :V: : : :V3: 50 

MTL3 :::::: :R: :G: : :SA: :Q:V; ; : : : :V: ; : :V: : : : j r : : : : 50 

MXMT1 3*KOXraiXGC3VSGENI^ 99 

MTL1 I:::::::::::::;::;::: :W:T: :K: :M: iV: :T 1 00 

MTL2 ::::: t :::::: s : s : : R : : M: 1 ::::::: :V: :T 1 00 

MTL3 2 : i : J :::::::::::::::::: i s 2 ;Ks :::::::::::: : 1 00 

MXMT1 XX^FlS^rgVFKIX^ 1 49 

MTL1 :::::::::: ^1: A: :::: :Hi :::::: : ISO 

MTL2 :::::::::: :M: ;A: :::: :H: : 1 SO 

MTL3 : :=P: :::::::::: r: = ;;S: ::; r: : 150 

MXMT 1 MEgLH9CTSV»Ci30^93LV^ 1 99 

MTU : ; : : : :S::LQF: ::::::: :T: : : :T: : :R: : : : : ;ASP: 200 

MTL2 : : : ; : :S; :LCF; ::::::: :T: :: :T: : :R; ::: : :ASP: 200 

MTL3 :CLQ: : : ; : : r : : :T: : : :ST: ::::::: :AS:Ii: 2QO 

MXMT1 h 'IKUfrTi ^!JCEH5KErFSRGRMIjLlt!TQgVn tj> J j-^n u > jtt j n ^T NTT^ 249 

MTL1 ::::::::: :MR:E: :L: :;:•.:::=: : :G: ;C:G: :TK; : :E; ; ; : : :V 250 

MTL2 : j : ...... : :R:E: :L: : j ;G: : ; :G: :TM: ; :E: : : : : :V 250 

MTL3 ............ :E: : : sH . . . : . , . . . :GE:L:AR:AI: : :E: : : : : : V 250 
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